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SUMMARY 

Tic110 is a major component of tlie chloroplast protein import translocon. Two functions with mutually 
exclusive structures have been proposed for TicHO: a protein-conducting channel with six transmembrane 
domains and a scaffold with two N-terminal transmembrane domains followed by a large soluble domain 
for binding transit peptides and other stromal translocon components. To investigate the structure of 
Tic110, Tic110 from Cyanidioschyzon merolae (CmTicHO) was characterized. We constructed three frag- 
ments, CmTicHOA, CmTicHOB and CmTicHOc, with increasing N-terminal truncations, to perform small- 
angle X-ray scattering (SAXS) and X-ray crystallography analyses and Dali structural comparison. Here we 
report the molecular envelope of CmTicllOs and CmTicHOc determined by SAXS, and the crystal structure 
of CmTicHOc at 4.2 A. Our data indicate that the C-terminal half of CmTicHO possesses a rod-shaped helix- 
repeat structure that is too flattened and elongated to be a channel. The structure is most similar to the 
HEAT-repeat motif that functions as scaffolds for protein-protein interactions. 

Keywords: Cyanidioschyzon merolae, CMQ342C, chloroplast, translocon, TicHO, HEAT repeats. 



INTRODUCTION 

Chloroplasts are essential organelles in plants and perform 
many important functions such as photosynthesis, amino 
acid synthesis and nitrogen assimilation. Most chloroplast 
proteins are nuclear encoded, synthesized as precursor 
proteins with an N-terminal transit peptide, and imported 
into chloroplasts via the translocon complex located at the 
two envelope membranes of chloroplasts. Translocon 
components in the outer and inner envelope membranes 
are called Toe and Tic (translocon at the outer- and inner- 
envelope membrane of chloroplasts) proteins, respectively. 
Toc75, Toc34 and Tod 59 are the major Toe components. 
Toc34 and Toc159 are membrane-associated GTPases 
acting as receptors for precursors. Toc75 is a p-barrel 
membrane protein and functions as the protein-conducting 
channel across the outer membrane. Many Tic proteins 
have been identified. They are proposed to function in 
precursor protein transport across the intermembrane 
space, in inner-membrane channel formation or in redox 
regulations (for reviews, see Jarvis, 2008; Kessler and 
Schnell, 2009; Kovacs-Bogdan et ai, 2010; Li and Chiu, 
2010). Three stromal ATPases, Hsp90C (Inoue et ai, 2013), 
cpHsc70 (Shi and Theg, 2010; Su and Li, 2010), and Hsp93 
(CIpC; Constan et al., 2004; Nielsen ef a/., 1997) are shown 



to be important for driving precursor protein translocation 
into the stroma. 

Three proteins, Tic20, Tic21 and TicllO, have been 
proposed to function as protein-conducting channels 
across the inner envelope membrane. Tic20 and Tic21 are 
both about 20 kDa in size with four transmembrane heli- 
ces, similar to amino acid permeases (Kouranov et al., 
1998; Teng ef a/., 2006). Tic20 has been shown to form a 
cation-selective channel by itself (Kovacs-Bogdan ef al, 
2011) and a precursor-protein-sensitive channel together 
with three other Tic components (Kikuchi ef a/., 2013). 
Tic21 has been shown genetically to function in the same 
pathway as Tic20 (Teng ef ai, 2006). Furthermore, a com- 
plex containing Tic20, Tic21 and translocating precursors, 
but not TicllO, has been identified in chloroplasts (Kikuchi 
ef al., 2009, 2013). 

Tici 10 is an essential translocon component (Inaba ef al, 
2005; Kovacheva ef al., 2005). Two models have been pre- 
sented for its topology and function. The first model 
describes TicllO with six transmembrane helices (herein 
referred to as TM1 to TM6; Figure la, model 1) located 
throughout the polypeptide. The first two transmembrane 
helices (TM1 and TM2) function as a signal-anchor 
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Figure 1. Purification of the three CmTlcllO 
proteins. 

(a) Schematic representations of the two struc- 
tural models of TicllO and the corresponding 
regions covered by CmTicllOA, CmTicllOB and 
CmTlcllOc. Locations of the proposed trans- 
membrane domains (TMl and TM2 are repre- 
sented by yellow boxes; TM3 to TM6 are 
represented by orange boxes), transit peptides 
(TP, pink boxes) and domains for transit- 
peptide binding (green box) and (co) chaperone 
binding (purple box) are marked. 

(b) CmTic1 10 proteins were purified to homoge- 
neity. One microgram of purified proteins was 
analyzed by SDS-PAGE and stained by Coomas- 
sie blue. 

(c) Purified CmTlcllO proteins were analyzed 
by gel filtration. The chromatograph of each 
CmTlcllO protein is shown. 
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sequence to target the protein to the inner membrane 
(Lubeck et al., 1997), and the rest of the polypeptide 
traverses the inner membrane four more times (TIVI3 to 
TIV16) to form a Ca^*-sensitive and cation-selective channel 
(Heins et al., 2002; Balsera et al., 2009; Kovacs-Bogdan 
ef al., 2011). TicllO is therefore proposed to be the major 
channel for translocation of protein across the inner 
membrane. The second model also shows TicllO with a 
membrane anchor composed of TMl and TM2, but the rest 
of the protein is entirely soluble and localized in the stroma 
(Figure la, model 2; Inaba ef al., 2005, 2003; Jackson ef al., 
1998). The N-terminal portion of the soluble domain binds 
transit peptides directly (Inaba ef al., 2003). TicllO has also 
been shown to interact with the stromal chaperones Hsp60, 
Hsp93 and Hsp90C and the stromal domain of Tic40 (Kess- 
ler and Blobel, 1996; Kouranov et al., 1998; Chou et al., 
2006; Chu and Li, 2012; Inoue et al., 2013). TicllO is thus 
proposed to function as a scaffold in the stroma for binding 
transit peptides emerging from the inner-membrane chan- 
nel, and for recruiting/tethering other translocon compo- 
nents located in the stroma. 

Structurally the two models for TicllO are mutually 
exclusive. To investigate the structure of TicllO, we cloned 
and expressed the TicllO homologue from Cyanidioschy- 
zon merolae 10D, herein named CmTicllO, for structural 



studies. Cyanidioschyzon merolae is a model red alga liv- 
ing in acidic hot springs, and its genome has been 
sequenced (Matsuzaki ef al., 2004). Small-angle X-ray scat- 
tering (SAXS) analyses of recombinant CmTicllO frag- 
ments revealed an elongated shape in solution. 
Crystallography combined with Dali searches shows a 
HEAT-repeat structure for a C-terminal fragment of 
CmTicllO. These data indicate that the C-terminal part of 
TicllO possesses the structural characteristics of a scaffold 
for protein-protein interactions. 

RESULTS 

Sequence analyses of TicHO from Cyanidioschyzon 
merolae 

TicllO is conserved from glaucophytes, red algae to flow- 
ering plants and is almost always encoded by a single 
gene, except in organisms like Physcomitrella patens in 
which the entire genome was recently duplicated (Kalanon 
and McFadden, 2008; Shi and Theg, 2013). We cloned and 
expressed TicllO homologues from several flowering 
plants for crystallography studies, but without success. We 
then cloned and expressed the TicllO homologue from 
C. merolae. TicllO is encoded by a single gene {CMQ342C, 
http://merolae.biol.s. u-tokyo.ac.jp/) in the genome of 
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C. merolae 10D (Matsuzaki et al., 2004). We compared its 
polypeptide sequence (CmTicllO) witli TicllOs from the 
green alga Chlamydomonas reinhardtii (CrTicllO), the 
moss P. patens (PpTicllO), and the flowering plants 
Arabidopsis thaliana (AtTicllO) and pea {Pisum sativum, 
PsTicllO). Transit peptide processing sites and transmem- 
brane helices were predicted using ChloroP (Emanuelsson 
et al., 1999) and TMpred (Hofmann and Stoffel, 1993), 
respectively. As shown in Figure 2, CmTicllO is predicted 
to have a chloroplast-targeting transit peptide at its N ter- 
minus, followed by the two transmembrane helices agreed 
upon by both models (TM1 and TM2, Figure 2; shaded in 
yellow), and a large C-terminal region, similar to other 
TicllOs. As previously reported (Balsera et al., 2009), 
CmTicllO has similar secondary structure features to 
TicllOs from green plants except that it contains several 
insertions in the regions between TM2 and TM3, and 
between TM3 and TM4 of model 1. The C-terminal half 
contains a leucine-zipper-like motif (L-x-x-L-x-x-x-L-G; 
Figure 2) conserved in all TicllOs (Kalanon and McFadden, 
2008). Secondary structure prediction by PHYRE2 (Kelley 
and Sternberg, 2009) also shows that CmTic1 10 exhibits an 
all Q(-helix structure (green tubes in Figure SI) like other 
TicllOs. This result also agrees with circular dichroism 
spectra of TicllO from pea and Arabidopsis, both of which 
show a high a-helix content (Inaba ef al., 2003; Balsera 
ef al., 2009). 

CmTicHO is a membrane protein in cliloroplasts 

To confirm that CmTicllO possesses a transit peptide for 
chloroplast targeting, CmTicllO was synthesized by 
in vitro transcription and translation and incubated with 
pea chloroplasts under import conditions. The full-length 
CmTicllO was synthesized as an approximately 150-kDa 
protein (Figure 3a, lane 1, arrow). After import, it was 
processed to three smaller-sized proteins, all around 
130-140 kDa (Figure 3a, bracket). These proteins were 
resistant to thermolysin treatments performed on chlorop- 
lasts after import (Figure 3a, lane 4), indicating that they 
were inside chloroplasts. Chloroplasts after import were 
further lysed hypotonically and separated into membrane 
and soluble fractions. The two higher-molecular-weight 
imported proteins were in the membrane fraction. The 
smallest protein was in the soluble fraction (Figure 3a, 
lanes 5 and 6). This protein might be a degradation prod- 
uct after the loss of the transmembrane helices at the N 
terminus. These data suggest that CmTicllO possesses 
a functional chloroplast-targeting transit peptide and 
imported CmTicllO was similar in size to higher plant 
TicllOs and anchored in the membrane. 

Many smaller polypeptides were produced in the in vitro 
CmTicllO translation reaction (Figure 3a, lane 1). Efforts to 
reduce the number of these fragments were unsuccessful. 
To rule out the possibility that these fragments have con- 



tributed to the imported proteins observed, and to get a 
better estimation of the size of CmTicllO transit peptide, 
we constructed a C-terminally truncated CmTicllO, 
CmTicI 10D363X, in which the codon encoding Asp of resi- 
due 363 was mutated to a stop codon. CmTicI 10D363x 
was synthesized as a protein of approximately 42 kDa 
(Figure 3b, lane 1, arrow). After import, it was processed to 
a mature protein of 32 kDa. The mature protein was inside 
chloroplasts, judging from its thermolysin resistance 
(Figure 3b, lane 4), and was located entirely in the mem- 
brane fraction (Figure 3b, lane 6). This result confirms that 
CmTicllO could be imported into chloroplasts and further 
showed that CmTicllO has a transit peptide of approxi- 
mately 10 kDa, placing the processing site very close to the 
region corresponding to the processing site of pea Tici 10. 

Recombinant CmTicHOs without TIVI1 and TIVI2 are 
soluble proteins 

To characterize the properties of CmTicllO in solution, we 
designed a series of N-termlnally truncated constructs 
lacking TM1 and TM2 to produce recombinant proteins in 
Escherichia coli. As shown in Figures 1(a) and 2, 
CmTicI 10a, composed of residues 273-1218, corresponds 
to almost the complete region after TM2. CmTicI 10b, 
composed of residues 720-1218, further lacks TM3 and 
TM4 of model 1. CmTicI 10c, composed of residues 871- 
1218, contains the corresponding region from the middle 
of TIV15 to the C terminus. All of the constructs were 
successfully expressed and purified as soluble proteins 
from E. coli. The SDS-PAGE analyses indicated that they 
were purified to homogeneity (Figure lb). Size exclusion 
chromatography indicated that all three proteins formed 
one major conformation in solution (Figure 1c). These data 
agree with published results showing that pea Tici 10 with- 
out TM1 and TIVI2 was over-expressed as a soluble protein 
in E. coli (Balsera et al., 2009), and Arabidopsis TicllO 
without TMI and TM2 was over-expressed as a soluble 
protein in both E. coli and Arabidopsis chloroplasts (Inaba 
et al., 2003, 2005). 

Small-angle X-ray scattering analyses of CmlidlOs 

Since all the recombinant CmTicllO proteins were in solu- 
ble form, we first employed SAXS to determine their 
molecular shape in solution. Data were obtained for 
CmTicI 10b and CmTicI 10c but not for CmTicI 10a because 
CmTicI 10a was prone to aggregation. Values for the radius 
of gyration (Rg) for CmTicI 10b and CmTicI 10c obtained 
from a Guinier plot using the primus program are 66.5 and 
49.2 A, respectively (Table SI). The SAXS data and the 
GNOM curve fitting are shown in Figure 4(a,b). The maxi- 
mum dimension (Dmaxl- determined to be about 300 A for 
CmTicI 10b and 200 A for CmTicI 10c, was generated from 
the distance distribution [P(r)] functions (Figure 4c,d). 
Based on the scattering data, 10 independent models were 
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Figure 2. Sequence alignment and domain features of TicllOs. 

Ticl 1 0 homologues from Pisum sativum (Ps), Arabidopsis thaliana (At), Physcomitrella patens (Pp), Chlamydomonas reinhardtii (Cr) and Cyanidioschyzon mero- 
lae (Cm) were aligned by ClustalW2, generated from ESPript (Gouet ef a/., 1999) and further manually adjusted. Fully conserved residues are shaded in red and 
regions with similar residues are boxed in blue. The transit-peptide processing site in PsTicllO is indicated by a downward arrow. The transmembrane helices 
are shaded in yellow and orange. The region corresponding to the transit-peptide-binding domain of model 2 is boxed in green. The N termini of CmTicllOA, 
CmTicHOB and CmTicllOc are marked with rightward arrows. The C terminus of CmTicl10D363x is marked with a leftward arrow. The conserved leucine- 
zipper-like motif (L-x-x-L-x-x-x-L-G) is indicated above the sequence. 
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Figure 3. CmTicllO is imported into pea chloroplasts. 

(a) In v/fro translated [^^SlIVlet-CmTicllO (Tr, lane 1) was treated with therm- 
olysin directly (lane 2) or incubated with isolated pea chloroplasts (Cpt, 
100 |.ig chlorophyll) under import conditions for 30 min. After import, a 
small portion of the chloroplasts (10 |.[g chlorophyll) were centrifuged 
through a 40% Percoll cushion to re-isolate intact chloroplasts (lane 3). The 
rest of the chloroplasts were digested with thermolysin, and then recovered 
through a 40% Percoll cushion (lane 4). Some of the thermolysin-treated 
chloroplasts (81 |jg chlorophyll) were further lysed hypotonically and sepa- 
rated into membrane (M, lane 5) and soluble (S, lane 6) fractions by centri- 
fugation. Samples were analyzed by SDS-PAGE, Coomassie blue staining 
and fluorography. Twenty micrograms of proteins were loaded in lanes 3-6. 
The arrow indicates the full-length CmTicllO and the bracket marks the 
three imported and process CmTicllO inside chloroplasts. LS, large subunit 
of ribulose biphosphate carboxylase in the stroma, which serves as a 
marker for the soluble fraction; CAB, chlorophyll a/fc-binding protein of the 
thylakoid membrane, which serves as a marker for the membrane fraction. 

(b) Import of CmTicl 10D363x. The experimental conditions and lane desig- 
nations are the same as (a), except lane 5 is the soluble fraction and lane 6 
is the membrane fraction. 
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generated using the program gasbor without any symmetry 
constraint. These 10 models were averaged by damever. 
Surface envelopes calculated from the data showed that 
both CmTicl 10b and CmTicl 10c have an elongated zigzag 
shape (Figure 4e,f). The structure of CmTicl 10c can be 
fitted into the central portion of CmTicl 10b (Figure S2c), 
suggesting that the structure of CmTicl 10b is an extension 
of that of CmTicl 10c. In addition, the hydrodynamic radius 
(/?h) of CmTicl 10b and CmTicl 10c particles, determined by 
dynamic light scattering, was calculated to be 73 and 50 A, 
respectively (Figure S3), consistent with their fig obtained 
from SAXS. 

Crystal structure of CmTicHOc 

We next subjected the three CmTicllO proteins to crystalli- 
zation trials; however, crystals suitable for X-ray diffraction 
were only obtained from CmTicl 10c. The crystal structure 
of CmTicllOc was determined by selenium single-wave- 
length anomalous dispersion (Se-SAD) at a resolution of 
4.2 A. The electron density map and overall structure trac- 
ing are depicted in Figure 5(a). Our structure covers resi- 
dues 903-1213, missing the first 32 and the last five 
residues of the CmTicllOc construct. The structure 
comprises 14 a-helices of seven continuous anti-parallel 
helix-loop-helix pairs (Figure 5b), forming an elongated 
shape of about 96 x 28 x 15 A. Several loops linking the 
helices were not located (dashed lines in Figure 5b). Due 
to the low resolution, CmTicllOc is assigned as a poly-Ala 
structure, except for the six Met residues. According to 
model 1 of the TicllO structure (Figure la), the construct 
of CmTicllOc contains TM6 and part of TM5. The region 
corresponding to TM5 could not be observed in our struc- 
ture due to high flexibility in the N-terminal end. The 
region corresponding to TM6 consists of helix 3 and part 
of the loop connecting helix 4 in our structure (Figure 5b, c, 
colored in yellow). This region is 28.2 A high, much 
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Figure 4. Small-angle X-ray scattering (SAXS) 
characterizations of CmTicllOB and CmTicllOc. 
(a, b) Experimental X-ray scattering curves 
(black line) and the theoretical fitting curves 
(red line) of CmTicllOe and CmTicllOc were 
generated with gnom. The insets show the 
Guinier plots. 

(c, d) The distance distributions of CmTicllOe 
and CmTicllOc. 

(e,f) Averaged low-resolution envelope 
obtained from 10 independent ab initio models 
of CmTicllOe and CmTicllOc derived from the 
SAXS data. 
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smaller than the 60-80 A thickness observed for the chlo- 
roplast inner envelope membrane (Bisalputra and Bailey, 
1973; Heber and Heldt, 1981). Therefore, this helix is unli- 
kely to cross the inner membrane. 

To search for possible functions of the CmTicllOc fold- 
ing, we carried out a structure-based homology analysis by 
Dali (Holm and Rosenstrom, 2010) with the Protein Data 
Bank. No membrane protein with a similar folding was 
found. From the Dali results, the protein Sral in the WAVE 
regulatory complex (3p8c) (Chen et al., 2010), was identi- 
fied as the closest structural homologue with a root mean 
square deviation (rmsd) of 4.3 A over 145 residues (Ca 
atoms, Figure S4a). Other structural homologues include 
the cullin Cul4A in the damage-specific DNA-binding 
protein 1 (DDB1)-Cul4A-Roc1-SV5-V complex (2hye; rmsd 
of 4.0 A over 148 residues; Figure S4b) (Angers et al., 
2006), nuclear pore complex protein Nup107 (3i4r; rmsd of 
2.6 A over 70 residues; Figure S4c) (Whittle and Schwartz, 
2009) and the cullin Cull in the Cull-Rbxi-Skpl-F box^'^P^ 
complex (lldj; rmsd of 3.1 A over 77 residues; Figure S4d) 



(Zheng ef al., 2002). These proteins all show an elongated 
shape formed by helical repeats that can be classified as 
the HEAT-like repeat structure. The structure is usually 
used to form complexes with other proteins. The presence 
of HEAT-like repeat structure in CmTicllOc also agrees 
with a previous report indicating that pea TicllO has 
several predicted HEAT repeats (Balsera ef al., 2009). How- 
ever, the report only indicated six HEAT repeats located in 
the regions between TM2 and TM3, between TIVI4 and TIVI5 
and in the C terminus after TIVI6. We thus reanalyzed pea 
TicllO and CmTicHO using HHrepiD (Biegert and Soding, 
2008), a repeated-protein-motif prediction server specifi- 
cally designed for HEAT repeats. Because the program only 
allows 800 amino acids per entry, the last 800 amino acids 
of pea TicllO (residues 197-996) and CmTicHO (residues 
419-1218) were analyzed. The results showed that pea 
TicllO was predicted to have eight HEAT repeats located 
throughout the polypeptide (Figure S5) and CmTicHO was 
predicted to have 12 HEAT repeats (Figure SI, magenta 
tubes, and Figure S5). In the region resolved by our crystal 
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Figures. Structure of CmTicl 10c. 

(a) Electron density map of experimental phase 
at 1.5a and model building of CmTicllOc based 
on the six selenium sites (red balls 1-6 for sele- 
nium in Met917, Met940, Metl017, Metl020, 
Metl051 and Metl212, respectively). 

(b) A ribbon drawing of the CmTicllOc struc- 
ture. Helix 3 and part of the loop connecting 
helix 4 (indicated in yellow) is the correspond- 
ing region of TM6 in model 1 shown in Fig- 
ure 1(a). 

(c) A close-up view of the region corresponding 
to TM6 and its dimensions. All the figures were 
generated with pymol (http://www.pymol.org/). 
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Structure in CmTicllOc, HHrepID predicted four HEAT 
repeats located in helices 4-5, 6-7, 8-9 and 10-11. Our 
structure indicates that helices 5-6, 7-8, 9-10 and 11-12 
form four HEAT repeats. Even though the repetitive helix 
pairs are shifted by one helix, the prediction of the HEAT 
repeat structure is accurate. 

It has been shown that recombinant pea TicllO forms 
dimers in solutions. It has a calculated molecular mass 
around 100 kDa, but its retention volume in gel filtration 
experiments corresponded to a globular protein of 400 kDa. 
It was suggested that the higher than calculated molecular 
mass was due to a possible non-globular shape of TicllO 
(Balsera ef a/., 2009). Pea and Arabidopsis TicllO also 
migrated as dimers on blue native (BN)-PAGE when solubi- 
lized from chloroplast membranes (Kikuchi ef a/., 2009, 
2013). The calculated molecular masses of CmTicl 10b and 
CmTicllOc are 59.9 and 42.8 kDa, respectively. The molec- 
ular masses for CmTicl 10b and CmTicllOc estimated from 
our SAXS data were 172 and 79.6 kDa, respectively (Table 
SI), in agreement with our gel filtration data (Figure 1c). 
Thus it is most likely that both CmTicllOs and CmTicllOc 
also existed as dimers in solution. The CmTicl 10b dimer 
has a larger than calculated molecular mass, most likely 
due to its elongated shape. In the crystal packing, 
CmTicllOc formed an extended superstructure with tail-to- 
tail and head-to-head packing (Figure S2a). We fitted both 
the head-to-head and the tail-to-tail dimers with the 
CmTicllOc SAXS data using crysol (Figure S2b). The tail- 
to-tail dimer fits slightly better (x-value 6.412) than the 
head-to-head dimer (x-value 8.308). The chi-squared values 
are relatively high for both dimers, most likely because our 
crystal structure is a poly-Ala structure with no side chains. 
These data suggest that the SAXS envelopes we observed 
were formed by CmTicl 10b dimers and CmTicllOc dimers 
(Figure S2c), and CmTicllOs and CmTicllOc have a similar 



oligomerization state to pea and Arabidopsis TicllO in 
solutions. 

DISCUSSION 

All secondary structure predictions suggest that all TicllOs 
analyzed so far have similar structures. The biochemical 
properties of CmTicl 10 fragments are also similar to those 
shown for Arabidopsis and pea TicllO (Inaba eta/., 2003; 
Balsera ef a/., 2009). Although sequence comparison 
shows that CmTicl 10 has several insertions right after 
TM2 when compared with TicllOs from other species, 
these insertions are outside the CmTicllOg and CmTicllOc 
regions we analyzed. Thus it is most likely that TicllOs in 
higher plants also have a similar structure to CmTicl 10 in 
the regions we analyzed. 

Our SAXS results show that CmTicl 10b has an elon- 
gated and flattened conformation in solution. The crystal 
structure of CmTicllOc further shows an elongated helical- 
repeat structure. The dimensions of CmTicllOc are about 
96 X 28 X 15 A. Measurements of the chloroplast inner 
envelope membrane from various species have estimated 
that the thickness of the inner membrane is about 60-80 A 
(Bisalputra and Bailey, 1973; Heber and Heldt, 1981). The 
only way that CmTicllOc can cross the membrane is 
through the 96 A dimension, which will bury about six 
hydrophilic o(-helices in the membrane and is energetically 
extremely unlikely. However, in liposome floatation experi- 
ments, the region corresponding to our CmTicl 10b from 
pea TicllO was able to bind to liposome (the IVI2 fragment 
in Balsera ef a/., 2009). Because our resolution does not 
allow assignment of side chains, it is possible that this 
region may have some small patches of hydrophobic sur- 
faces that allow attachment to the neutral head group of 
phosphatidylcholine used for the liposome experiments. 
Phosphatidylcholine is not present in the chloroplast inner 
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envelope membrane (Douce and Joyard, 1990). Whether 
the M2 fragment will insert into liposomes with lipid com- 
positions of the chloroplast inner envelope membrane 
remains to be tested. When expressed in the chloroplast 
stroma, the entire region after TM2 of Arabidopsis TicllO 
exists as a soluble protein in the chloroplast stroma and 
did not insert into the inner membrane (Inaba ef al., 2003, 

2005) . 

In work showing that TicllO has channel activities in 
vitro, it was shown that the first three transmembrane 
domains in model 1 (TM1 to TM3) did not possess channel 
activity when reconstituted into liposome. The channel 
activity was provided by the last four transmembrane 
domains (TM3 to TM6) (Balsera et al., 2009; Kovacs-Bogdan 
ef al., 2010). Our structural data indicate that TIVI5 and TM6 
are unlikely to exist. We cannot exclude the possibility that 
the region not covered by our analyses, the TM3 and TM4 
regions of model 1, provided the channel activity observed 
in the reconstituted liposome. Of note is the fact that the 
six-transmembrane-domain topology of TicllO was 
deduced using isolated inner membrane vesicles (Balsera 
ef al., 2009), while experiments using intact chloroplasts 
have shown that the region after TM2 is entirely inside the 
stroma (Kessler and Blobel, 1996; Jackson ef al., 1998; Chou 
ef al., 2003; Inaba ef al., 2003, 2005; Kikuchi ef al., 2009). 
Isolated inner membrane vesicles may have mixed orienta- 
tions and may complicate the experimental results. 

Although the structure of the complete region after TM2 
is currently unavailable, the structures of CmTicllOB and 
CmTicllOc provide direct evidence showing that half of 
the region after TM2 has a HEAT-repeat structure with an 
elongated shape. All structural homologues we obtained 
from Dali also have elongated shapes, with the HEAT 
repeats forming the region for interacting with partner 
proteins in the same complex. For example, the CmTicI 10c 
structure aligns best to subdomain 4 within domain I of 
Sral. This part of Sral is composed entirely of HEAT 
repeats and provides the surface for interacting with Napl 
and WAVE1 of the WAVE regulatory complex (Chen ef al., 
2010). In another example, the N-terminal long stalk-like 
domain of Cull and Cul4A, both consisting of three HEAT- 
like repeats and the first few repeats, which aligned best to 
CmTicllOc, binds to the adapter protein Skpl in Cull 
(Zheng et al., 2002) and to DDB1 in Cul4A (Angers et al., 

2006) , respectively. Therefore, the shape and structure of 
the TicllO C-terminal region most likely also enable TicllO 
to function as a scaffold for interacting with other proteins 
in the chloroplast translocon complex. 

It was argued that Tic20 could not be the major protein 
translocation channel across the inner membrane because 
it was present at <1/20 of the molar ratio of TicllO 
(Kovacs-Bogdan ef al., 2011) and <1/10 that of Toc75 (Vojta 
ef al., 2004). However, a recent direct analysis has shown 
that the stoichiometry of Tic20 to Toc75 is actually about 



1:2.5 (Kikuchi et al., 2013). Furthermore, Tic20, together 
with three other essential Tic proteins, Tic56, TiclOO and 
Tic214, forms a precursor-sensitive channel when reconsti- 
tuted in planar lipid bilayers. In chloroplasts, these four 
proteins form a stable 1-MDa complex, and TicllO and 
Tic40 are not in this complex (Kikuchi et al., 2013). It is 
therefore most likely that the 1-MDa Tic20 complex forms 
the channel traversing the inner membrane, and TicllO, 
through binding transit peptides and scaffolding the vari- 
ous stromal chaperone motors and co-chaperone, func- 
tions in the next stage of the import process facilitating 
complete precursor translocation into the stroma. 

EXPERIMENTAL PROCEDURES 

Plant materials and growth conditions 

For growing pea seedlings (P. sativum cv. Little Marvel, De Bruyn 
Seed Co., http://www.debruvnseed.com/), the imbibed seeds were 
grown on vermiculite for 8-10 days under a 12-h photoperiod at 
20°C with a light intensity of approximately 150 |jmol m"^ sec"\ 

Plasmld construction 

Because the gene encoding CmTic1 10 contains no introns 
(MatsuzakI eta/., 2004), the coding region of CmTic1 10 was 
amplified from the genomic DNA of C. merolae 10D by PCR using 
the CmTicllO-FI and CmTicllO-RI primer pair (Table S2) and 
cloned into the pGEM-T vector (Promega, http://worldwide.pro- 
mega.com/). The sequence was confirmed by sequencing and the 
plasmid was named pGEM-T-CmTicllO. The plasmid was then 
used as a template to generate the three N-terminal truncation 
mutants by PCR using the following primers: CmTicI 10a-F1 and 
CmTic110R2 for CmTicllOA (residues 273-1218); CmTicllOg-FI 
and CmTic110R2 for CmTicllOe (residues 720-1218); CmTicllOc- 
F1 and CmTicI 10R2 for CmTicllOc (residues 871-1218). A A/del 
site and a X/iol site were added to the forward and the reverse 
primers, respectively (Table S2). The fragments were digested 
and cloned in the Nde\/Xho\ site of pET28a and the resulting 
plasmids were named pET28a-CmTic1 10a, pET28a-CmTic110B 
and pET28a-CmTic110c, respectively. The recombinant proteins 
produced have a Hise tag at both ends. CmTicI 10D363x was 
generated from pGEM-T-CmTicllO using the QuikChange II Site- 
Directed Mutagenesis Kit (Agilent Technologies, http://www. 
chem.agilent.com/store/Default.aspx) with primers CmTic1 10- 
D363amber-F and CmTicI 10-D363amber-R (Table S2). The frag- 
ment corresponding to CmTic110D363x was further amplified 
with primers CmTic110-F5-H/ndlll and CmTicI 10-R5-X£ial and sub- 
cloned into the H/ndlll/X/7ol site of pSP72 (Promega) to generate 
pSP72-CmTic110D363x. 

In vitro translation, protein import and post-import 
analyses 

I^^SIMet-CmTicllO and CmTicI 10D363x were in vitro transcribed/ 
translated in the TNT reticulocyte lysate system (Promega). Chlo- 
roplasts were isolated from pea seedlings as described (Perry 
eta/., 1991). Isolated chloroplasts were adjusted to 1 mg chloro- 
phyll mr'' in import buffer (330 mwi sorbitol, 50 mM HEPES-KOH, 
pH 8.0) for import assays. [^^SlMet-labeled precursor proteins 
were incubated with isolated chloroplasts in the presence of 3 mM 
ATP in import buffer at 25°C for 30 min. After import, intact chlo- 
roplasts were re-isolated through a 40% Percoll cushion at 4''C, 
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washed once with import buffer and dissolved in protein extrac- 
tion buffer containing 300 miui 2-amino-2-(hydroxymethyl)-1,3-pro- 
panediol (TRIS)-HCI, pH 8.5, 1 mM EDTA, 8% SDS (w/v) and 1 mM 
phenylmethylsulfonyl fluoride. Thermolysin treatment of chlorop- 
lasts after import was performed as described (Perry et al., 1991). 
To separate chloroplasts into soluble and membrane fractions, 
chloroplasts after import were lysed by a hypotonic buffer (25 mM 
HEPES-KOH, pH 7.5, 5 mM MgCIa) at a chloroplast concentration 
of 0.5 mg chlorophyll mT^ and then separated into soluble and 
membrane fractions by ultracentrifugation at 100 000 gfor 45 min 
at 4°C. Soluble fractions were precipitated by 10% (w/v) trichloro- 
acetic acid, washed with 100% ice-cold acetone and dissolved in 
protein extraction buffer. Membrane fractions were washed with 
the hypotonic buffer once, collected by another ultracentrifugation 
at 100 000 g for 15 min at 4''C and the pellets were resuspended 
by the protein extraction buffer. Samples were analyzed by SDS- 
PAGE, Coomassie blue staining and fluorography. 

Protein expression, purification and characterizations 

For protein preparations, pET28a-CmTic1 10a, pET28a-CmTic1 10b 
and pET28a-CmTic110c were transformed into the E. coli strain 
BL21(DE3). Protein expression was induced with 1 mM isopropyl 
P-D-1-thiogalactopyranoside after the ODeoo of the culture 
reached about 0.6. The cells were further cultured at 37''C for 
another 3 h. Bacterial cells were harvested by centrifugation at 
5000 g for 30 min at 4°C, resuspended in Ni-NTA column bind- 
ing buffer (20 mM TRIS-HCI, pH 7.8, 500 mM NaCI and 5 mM 
imidazole) and lysed under high pressure using a microfluidizer. 
The clear cell lysates after centrifugation were purified using 
Ni-NTA affinity chromatography (GE Healthcare, http://www3. 
gehealthcare.com/en). Proteins were eluted using step imidazole 
concentrations. The majority of these three proteins were eluted 
at 200 mM imidazole. Gel filtration experiments were performed 
on HiLoad 16/60 SuperdexTM 200 preparation-grade columns 
connected to an AKTA-FPLC system (GE Healthcare) in 20 mM 
TRIS pH 7.8, 500 mM NaCI and 200 mM imidazole. For the prepa- 
ration of the selenomethionyl derivative, pET28a-CmTic110c was 
further transformed into BL21 (B834) for SeMet-CmTicI 10c over- 
expression. The purification procedures were the same as 
described above. 

Small-angle X-ray scattering analyses 

Protein samples for SAXS analyses were collected from gel filtra- 
tion (Figure 1c) and the concentrations of proteins were deter- 
mined by the Bradford dye-binding method. The SAXS 
experiments were performed at the Beamline 23A at National 
Synchrotron Radiation Research Center (NSRRC) in Taiwan (Liu 
et al., 2009) using a MARCCD165 detector. The protein sample 
concentration was 1 mg mTV The X-ray energy was 14.0 keV and 
the collection time was 300 sec. The buffer for gel filtration 
(20 mM TRIS pH 7.8, 500 mM NaCI and 200 mM imidazole) was 
used as the solvent blank. Primary data reduction was done with a 
NSRRC 23A homemade program. The processed data were 
analyzed using the atsas package (Petoukhov et al., 2007): the 
radius of gyration (Rg) was calculated using primus, P(r) and D^a^ 
were calculated using gnom, low-resolution shapes of protein 
samples were determined ab initio from the scattering data by 
GASBOR and averaged by damever, the calculated scattering curve 
was fitted with the structure of CmTicllOc using crysol, and SAXS 
envelopes and crystal structures are superimposed using supcomb. 
The molecular mass of proteins in solution determined from 
SAXS was done by SAXS MoW (http://www.if.sc.usp.br/~saxs/ 
saxsmow.html). 



Crystallization of CmTidlOc 

Crystallization trials were setup with 6 mg mP^ protein using the 
hanging-drop vapor-diffusion method (McPherson, 1990). Initial 
screening was performed with a sparse-matrix screen from Hamp- 
ton Research (http://hamptonresearch.com/Default.aspx). Only 
CmTicllOc crystals were obtained by a mixture of equal volumes 
of protein solution and reservoir buffer containing 20 mM TRIS- 
HCI pH 8.0, 300 mM NaCI, 18% PEG400 (v/v), 15% glycerol (v/v) 
and 200 mM CaCl2. The crystals grew to maximum dimensions of 
0.1 X 0.1 X 0.2 mm within 3 days at 20°C. The selenomethionyl 
derivative crystals (SeMet-CmTicI 10c) were obtained under the 
same conditions. The crystals grew to maximum dimensions of 
0.1 X 0.05 X 0.2 mm within 1 week at 20°C. However, these crys- 
tals diffracted poorly. To improve crystal quality, dehydration in a 
higher concentration of the original crystallization conditions with 
the addition of 5% tacsimate (v/v) resulted in greatly improved 
crystal quality. The SeMet-CmTicI 10c crystal grown with this addi- 
tive diffracted to 4.2 A, and was used for data collection. The crys- 
tals belong to the P6522 space group with unit-cell parameters 
a = t) = 121.1 A and c = 242.4 A. A single wavelength was chosen 
for the data collection. A complete dataset was collected to a reso- 
lution of 4.2 A on an ADSC Quantum-315 charge-coupled device 
detector using a synchrotron radiation X-ray source at Beamline 
BL13B1 of the NSRRC. The data were indexed and processed with 
HKL2000 (Otwinowski and Minor, 1997). solve (Terwilliger and 
Berendzen, 1999) was used to locate the Se sites and generate the 
initial SAD phase at a 4.5 A resolution. The density modification 
was calculated with resolve (Terwilliger, 2000) using solvent 
flattening. XtalView (McRee, 1999) was used for examining elec- 
tron density maps and manual model building. Further refine- 
ments were performed using cns (Brunger et al., 1998). The final 
model for all reflections above 2a between 27.37 and 4.2 A resolu- 
tion was refined to 35.0% and a Rfree value of 36.6% was obtained 
using 10% randomly distributed reflections. The crystallographic 



Table 1 X-ray diffraction data and refinement statistics 



Crystal 


Se-Met derivative 


Space group 


P6522 


Wavelength (A) 


0.9787 


Resolution (A)" 


4.2 


Unit cell (A) (a/c) 


121.1/242.4 


Total no. of reflections 


102 418 


No. of unique reflections 


8171 


Redundancy 


12.5 


Completeness {%f 


99.7 (99.7) 




27.8 (4.1) 


ffsym (%) 


10.4 (53.2) 


Refinement: 




Resolution (A) 


27.37-4.2 


R-factor (%) 


35.0 


ffree (%) 


36.6 


Number of reflections used 


6673 


Number of atoms (non-hydrogen) 


1351 


r.m.s. deviation for bond length (A) 


0.011 


r.m.s. deviation for bond angle (°) 


2.5 


Average B-factors: 




All protein atoms (A^) 


74.6 



"Numbers in parentheses are for the highest resolution shell, 
'''^sym = ^\lh-<lh>\/T,lh, where <lh> is the average intensity over 
symmetry-equivalent reflections. 
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data and refinement statistics are shown in Table 1. The atomic 
coordinates for the CmTicllOc structure have been deposited in 
the Protein Data Banl< under the accession code 4bm5. 
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